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Abstract 

A new way to design parameter estimators with enhanced performance is proposed in the paper. The procedure consists of two 
stages, first, the generation of new regression forms via the application of a dynamic operator to the original regression. Second, a 
suitable mix of these new regressors to obtain the final desired regression form. For classical linear regression forms the procedure 
yields a new parameter estimator whose convergence is established without the usual requirement of regressor persistency of 
excitation. The technique is also applied to nonlinear regressions with “partially” monotonic parameter dependence—giving rise 
again to estimators with enhanced performance. Simulation results illustrate the advantages of the proposed procedure in both 
scenarios. 


Index Terms 

Estimation, persistent excitation, nonlinear regressor, monotonicity 


I. Introduction 


A new procedure to design parameter identification schemes is proposed in this article. The procedure, called Dynamic 
Regressor Extension and Mixing (DREM), consists of two stages, first, the generation of new regression forms via the application 
of a dynamic operator to the data of the original regression. Second, a suitable mix of these new data to obtain the final desired 
regression form to which standard parameter estimation techniques are applied. 

The DREM procedure is applied in two different scenarios. Eirst, for linear regression systems, it is used to generate a 
new parameter estimator whose convergence is ensured without a persistency of excitation (PE) condition on the regressor. It 
is well known that standard parameter estimation algorithms applied to linear regressions give rise to a linear time-varying 
system, which is exponentially stable if and only if a certain PE condition is imposed—this fundamental result constitutes 
one of the main building blocks of identification and adaptive control theories To the best of the authors’ knowledge 

there is no systematic way to conclude asymptotic stability for this system without this assumption, which is rarely verified in 
applications. Relaxation of the PE condition is a challenging theoretical problem and many research works have been devoted 
to it in various scenarios, see e.g., Q-Q and references therein. Due to its practical importance research on this topic is of 
great current interest. 

The second parameter estimation problem studied in this article is when the parameters enter nonlinearly in the regression 
form. It is well known that nonlinear parameterizations are inevitable in any realistic practical problem. On the other hand, 
designing parameter identification algorithms for nonlinearly parameterized regressions is a difficult poorly understood problem. 
An interesting case that has recently been explored in the literature is when the dependence with respect to the parameters 
exhibit some monotonicity properties; see ID-IIl- Unfortunately, it is often the case that this property holds true only for 
some of the functions entering in the regression stymying the application of the proposed techniques. Our second contribution 
is the use of the DREM technique to “isolate” the good nonlinearities and be able to exploit the monotonicity to achieve 
consistent parameter estimation for nonlinearly parameterised regressions with factorisable nonlinearities— not imposing PE 
conditions. 

The remaining of the paper is organized as follows. The DREM technique is first explained with its application to linear 
regressions in Section In Section III DREM is used for nonlinear factorisable regressions with “partially” monotonic 
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parameter dependence. In both sections representative simulation examples are presented. Some concluding remarks and future 


research are given in Section IV 


Notation For x S K", \x\ is the Euclidean norm. All functions and mappings in the paper are assumed sufficiently smooth. 
For functions of scalar argument g : M. —>■ R®, g' denotes its first order derivative. For functions V : K” —> R we define the 
operator W := (^)^- Also, for mappings $ : R" x R'J —R" we define its (transposed) Jacobian matrix Vj;^{x,9) := 
[V<l>i(a:, 0),..., 0)]. For the distinguished element a;* G R" and any mapping F : R” —>■ R® we denote F!* := F(a;*). 


II. Consistent Estimation for Linear Regressions without PE 

In this section the DREM technique is applied to classical linear regressions. The main contribution is the removal of 
the—often overly restrictive—assumption of regressor PE to ensure parameter convergence. 


A. Standard procedure and the PE condition 

Consider the basic problem of on-line estimation of the constant parameters of the q-dimensional linear regression 

y{t) = m^{t)0, (1) 

wherry : R+ —R and m : R+ —R"^ are known, bounded functions of time and 0 G R'J is the vector of unknown parameters. 
The standard gradient estimator 

9 = Tm{y — 6), (2) 

with a positive definite adaptation gain F G R^^i yields the error equation 

9 =—Tm{t)m^ {t)9, (3) 

where 9 := 9 — 9 are the parameter estimation errors. It is well-known 0’ § that the zero equilibrium of the linear 
time-varying system is (uniformly) exponentially stable if and only if the regressor vector m is PE, that is, if 

J m{s)m7 {s)ds > 6Iq, (4) 

for some T,S > 0 and for all t > 0, which will be denoted as m{t) G PE. If m{t) ^ PE, which happens in many practical 
circumstances, very little can be said about the asymptotic stability of Q, hence about the convergence of the parameter errors 
to zero. 

Remark 1. In spite of some erroneous claims it is well known that the PE conditions for the gradient estimator presented 
above and more general estimators—like (weighted) least squares—exactly coincide IB- Since the interest in the paper is to 
relax the PE condition, attention is restricted to the simple gradient estimator. 

Remark 2. To simplify the notation it has been assumed above that the measurement signal y is one-dimensional. As will 
become clear below DREM is applicable also for the vector case. 


B. Proposed dynamic regressor extension and mixing procedure 

To overcome the limitation imposed by the PE condition the DREM procedure generates q new, one-dimensional, regression 
models to independently estimate each of the parameters under conditions on the regressor m that differ from the PE condition 

0 . 

The first step in DREM is to introduce y — 1 linear. Coo-stable operators Hi : Coo * € {1) 2,..., y — 1}, whose 

output, for any bounded input, may be decomposed as 


(•)/.w := m-m+^u 


(5) 


with ct is a (generic) exponentially decaying term. For instance, the operators Hi may be simple, exponentially stable LTI 
filters of the form 


Hr{p) 


OLi 

PF Pi' 


'When clear from the context, in the sequel the arguments of the functions ai‘e omitted. 


( 6 ) 
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with P and Ui ^ 0, Pi > 0; in this case et accounts for the effect of the initial conditions of the hlters. Another option 
of interest are delay operators, that is 

[HPm := {Pit-dP, 


where di G M+. 

Now, we apply these operators to the regressor equation ([T]| to get the hltered regressior0 

(7) 

Piling up the original regressor equation 0 with the q — 1 hltered regressors we can construct the extended regressor system 

Y,it) = (8) 

where we dehned : R+ —^ and Mg : M_|_ —>• as 


(9) 


Note that, because of the £oo-stability assumption of Hi, Y^ and are bounded. Premultiplying ([^ by the adjunct matrix 
of Me we get q scalar regressors of the form 

Ypt) = 



y 


mA 

n := 

. . . 

, Me := 



1 

1 

_1 




with i G q := {1,2,... ,q}, where we dehned the determinant of Me as 

Pit) := det{Me(t)}. 

and the vector Y : R+ —>• R'^ 

Y{t) := adi{Meit)}Yeit). 

The estimation of the parameters 9i from the scalar regression form ( fTOl i can be easily carried out vit 

Oi = l%p{Yi - p9i), i G q, 

with adaptation gains ji > 0. From ( fTO) ! it is clear that the latter equations are equivalent to 

9i = -^ip‘^9i, i G q. 


( 10 ) 

( 11 ) 

( 12 ) 

(13) 

(14) 


Solving this simple scalar differential equation we conclude that 

Pit) ^ £2 lim 9iit) = 0, (15) 

with the converse implication also being true. 

The derivations above establish the following proposition. 

Proposition 1. Consider the g-dimensional linear regression 0 where y : R_|_ — )■ R and m : R+ — )■ R'^ are known, 
bounded functions of time and 0 G R'J is the vector of unknown parameters. Introduce q — 1 linear, £oo-stable operators 
Hi : ^ C^, i G {1,2,... ,q - 1} verifying 0. Dehne the vector Ye and the matrix Me as given in (|^. Consider the 

estimator ( [T3| l with p and Yi dehned in ( [TT] ) and ( [T^ , respectively. The implication ( ffS] ) holds. 

□□□ 

Remark 3. It is important to underscore that for any matrix A G 

adi{A}A = det{A}/q, (16) 

even if A is not full rank, |T5}. 

Remark 4. If we take into account the presence of the exponentially decaying terms et in the hltering operations the error 
equation ( [T4l l becomes 

9i = -yip'^9^ + et, i G q. 

The analysis of this equation, which establishes ( [T5] l, may be found in Lemma 1 of 0- 

^To simplify the presentation in the sequel we will neglect the et terms, which will be incorporated in the analysis later. 
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C. Discussion 


Two natural question arise at this point. 

Ql. Is the condition (pit) ^ £2 weaker than m{t) G PE? 

Q2. Given a regressor m{t) ^ PE how to select the operators Hi to enforce the condition (p{t) ^ £ 2 ? 

Regarding these questions the following remarks are in order. 

Rl. It is important to recall that Q 

m{t) G PE m{t) ^ £ 2 . 


However, the converse is not true as shown by the example 

m{t) = 


1 


Vl + f’ 

which is neither square integrable nor PE. 

R2. Consider the regressor m{t) := [sin(f) cos(f)]^ and the operator 

c(p+l) 

+P + 2 

where c > 0. Note that for unit frequency the operator H provides zero phase shift and the magnitude gain c. Thus 
mif{t) = csin(f), m 2 /(t) = ccos(f) and 


Meit) = 


sin(f) cos(f) 
csin(f) ccos(f) 


Obviously, m{t) G PE, but det{Me(f)} = 0 and <p{t) G £ 2 . 

R3. Erom dehnition Q it is clear that the PE condition is a requirement imposed on the minimal eigenvalue of the matrix 
illustrated by the equivalence 


•^min 


TO(s)m^(s)c?s| > (5 > 0 


m{t) G PE, 


where Aminl’} denotes the minimal eigenvalue. On the other hand, the condition (p{t) ^ £2 is a restriction on all 
eigenvalues of the matrix M^. Indeed, this is clear recalling that the determinant of a matrix is the product of all its 
eigenvalues and that for any two bounded signals a, b : K+ —K we have 


a{t)b{t) ^ £2 a{t) ^ £2 and b{t) ^ £ 2 . 

Consequently, a necessary condition for parameter convergence of the estimators ( [T3] l is that all eigenvalues of the matrix 
Me are not square integrable. 

Although the remarks R1-R3 do not provide a dehnite answer to the question Q2, they illustrate the fact that the parameter 
estimation procedure proposed in the paper gives rise to new convergence conditions that radically differ from the standard 
PE requirement. 


D. An example 

To provide some (partial) answers to the question Q2 above let us consider the simplest case of q = 2 with m = col(mi, m 2 ). 
In this case 

(p = mim2f - rnifm2- (17) 

The proposition below identihes a class of regressors m{t) ^ PE but p{t) ^ £2 for the case of H a simple LTI hlter. 
Proposition 2. Dehne the set of differentiable functions 

Q := {g : M_|_ —t M | g{t) G £ 00 , g{t) G Coo, 9{t) ^ ^ 2 , 

lim g{t) = lim git) = 0} 







5 


For all g G G the regressor 


Let the operator H be defined as 


m{t) = 


1 

9 + 9 


PE. 


[^(•)](i) = 


1 


P + 1 


(•) 


[t). 


The function (p defined in ( [T7] ) verifies (p{t) ^ £ 2 - 

Proof: The fact that m(t) ^ PE is obvious because limt_>.oo m 2 {f) = 0. Now, we have that mi/ = 1 + ej and from the 
filter equations we get 

7712 / = —m2f + m2- 

On the other hand, from the definition of m we have 


9 = -9 + m 2 - 


Substracting these two equations we get 


— {m 2 f-g) = -{m 2 f -g), 


consequently m 2 / = g + Ct- Replacing these expressions in ( [TtI i yields 

(j) = 7772/ - (1 + et)m2 

= (ff+ et) - (1+ et)(5 + 5) 

= —g + ^t, 

where we have used the fact that g{t) G £00 and 9 (i) G -Coo to obtain the last equation. This completes the proof 
Remark 5. An example of a function g G G is 

9(i) = 


sinf 

(iTf)l 


The corresponding regressor is 


m{t) = 


1 


sin t+cqs t 


sin t 


(18) 


2(l+t)2 

Remark 6. The choice of a = /3 = 1 for the proposed operator H is made without loss of generality, as a similar proposition 
can be established for any exponentially stable LTI filter. The choice of the filters is, then, a degree of freedom to verify the 
conditions ^ £ 2 . 


E. Simulation results 

We first evaluate the performance of the classical parameters estimator with m{t) given by Erom the analysis of 
Subsection II-A we know that the LTV system ([^ is stable, but it is not exponentially stable since m{t) ^ PE, and PE is a 


necessary condition for exponential stability. 

The transient behavior of the parameter errors 9{t) with T = 7/2 and 9 = col(—3,3) is shown in Eig.j^for 0(0) = col(3, —3), 
7 = 3 and 7 = 10. It is worth noting that it is not possible to conclude from the simulations whether 9{t) converges to zero 
asymptotically or not. The plots show that convergence has not been achieved even after a reasonably long period of 500. 
The graphs also show that increasing 7 that, in principle, should speed-up the convergence, makes the situation even worse, 
cf. Eig. [^(a) and (b). In Eig. |^we show the integral curves for 7 = 3 for initial conditions taken in a disk. If the adaptation 
gain is taken as E = diag{ 7 i, 72 } it is possible to improve the transient performance, but this requires a time-consuming, 
trial-and-error tuning stage that is always undesirable. 

Next we study performance of the DREM estimator ( [T3| ) with the same m{t) and 9 = col(—3,3). The transient behavior of 
9{t) is given in Eig. |^for 0(0) = col(3, —3), 712 = 3 and 71 2 = 10. The integral curves, for initial conditions taken in a disk, 
for 7 i 2 =3 are given in Eig. The simulations illustrate significant performance improvement both in oscillatory behavior 
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(a) 7 = 3 (b) 7 = 10 

Fig. 1 : Transient performance of the parameter errors 9{t) for the gradient estimator (|^ with m{t) given by ( fTS] ), 0(0) = 
col(3, —3) and T = 7 / 2 , 7 = 3,10. 


<D 



Fig. 2: The integral curves of the parameter estimator 0 with m{t) given by (HD), r = 3/2 and initial conditions taken in a 
disk. 


and in convergence speed—notice the difference in time scales. Moreover, tuning of the gains 7 ^ in the DREM estimator is 
straightforward. For example, given « 0.78, where (j) is defined in ( [TtI i, for 7 i = 3 one obtains 

0,(10) « e-3'O-7807o) « 0.09 0,(0), 

and for 7 ^ = 10 

0,(10) « e-i° °-™0i(O) « 0.00040i(O), 
which coincides with the behaviour observed in Figs. |^(a) and (b). 

III. Parameter Estimation oe “Partialey” Monotonic Regressions 

In this section we propose to use the DREM technique for nonlinearly parameterised regressions with factorisable nonlin¬ 
earities, where some —^but not all—of the functions verify a monotonicity condition. The main objective is to generate a new 
regressor that contains only these “good’ nonlinearities. 

We consider factorisable regressions of the form 

y(t) = m(f)i/;(0), (19) 

where y : K_|. —>■ K" and m : M+ contain measurable functions, the mapping —>■ is known and 0 G K'J is 

the unknown parameter vector. 
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time [s] 


time [s] 


(a) 71,2 =3 


(b) 71,2 = 10 


Fig. 3: Transient performance of the parameter errors 9{t) for the DREM estimator ( [T3] ) with m{t) given by ( [T8] l, e{0) = 
col(3, —3) and 71^2 = 3,10. 



Fig. 4: The integral curves of the DREM parameter estimator d with m{t) given by (El, 7 i 2 = 3 and initial conditions 
taken in a disk. 


Eor the sake of ease of exposition before presenting a general result, which is postponed to Subsection III-C in the next 
two subsections we apply the DREM technique to two representative examples. It is clear that the nonlinear regression El 
can be “transformed” into a linear one defining the vector p := '0(6*) to which the standard gradient estimator 

ry = rm^(y — mp) (20) 

can be applied. In Subsection |III-D| a brief discussion on the disadvantages of overparameterisation is presented. To extend 


the realm of application of the DREM technique in Subsection III-E we consider the case of nonlinear dynamical systems 
that depend nonlinearly on unknown parameters—via non-factorisable nonlinearities. A procedure to generate a factorisable 
regression of the form El, which is valid in a neighbourhood of the systems operating point, is given. 


A. First example 

To illustrate the use of DREM for nonlinearly parameterised factorisable regressions let us first consider the simplest scalar 
case of n = 1, p = 2 and q = 1. The regression El becomes 


y(f) = mi(f) m 2 (f) 


V'i(6») 

' 02 ( 6 ') 


( 21 ) 































where y : IR_|_ —>■ K, 111 ^ : IR+ —>■ M and '0^ : K —)■ K, for i = 1,2. Assume that '!/'i(^) is strongly monotonically increasing, 
that is, 

tp[{d) > Po> 0. 


In this case, the function 'ipi verifies CD 

(a — 5)[V'i(a) — tpi{b)] > pi{a — 6)^, Va, 6 S K, 

for some pi > 0. 


( 22 ) 


The goal is to generate a new regression where, similarly to 110|-| 121, the property p2l ) can be exploited. Following the 
iREM procedure described in Subsection we apply an operator H to (RTli and pile-up the two regressions as 


'y(i)' 


mi(f) m 2 (t) 

tpi{e) 

y/(0_ 





Multiplying on the left the equation above by the row vector [m 2 / — m 2 ] we get the desired regression involving only tpi, 
namely. 


where we defined the signals 


The estimator 


with 7 > 0, yields 


Y := m2 /y-m2y/ 

$ := m2/mi—m2mi^. 

0 = 7 $[Y( 0 )], 


(23) 

(24) 


0 = 

To analyse the stability of this error equation consider the Lyapunov function candidate 


V{0) = -6\ 
27 


whose derivative yields 


< -pi$2(0_0)2 

= -2pi7$V, 

where the first inequality follows from ( |22l l. Integrating the previous inequality yields 

V{t) < e-^PiT'/o 

which ensures that 6{t) —)■ 0 as f —)■ c» if 4)(f) ^ £ 2 - 
As an example consider the regression 

y{t) = mi(f) (0 - e“®) + m 2 (f) cos(0) 


p 

1 

’ 0 -e-® ' 

mi(f) 

m2(t) 

cos(0) 


=: m(<)V'(6»), 

which clearly satisfies condition ( |2^ . The vector m(f) ^ PE stymies the application of overparameterisation. Moreover, since 
the mapping only locally injective, a constrained estimator of rj is required to recover the parameter 0. 

The proposition below identifies a class of regressors m(f) ^ PE but $(f) ^ £2 for a simple delay operator. 


Proposition 3. The regressor 


m(f) = 


sin(t) 

^i-l-27r 


PE. 
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Fig. 5; Transient behaviour of the errors fj{t) for the over-parameterized parameter estimator ( |20l i for different adaptation 
gains; f}(0) = 0 and t)(0) = —'0(0) = —0(1) ~ [—0.63, —0.54]^ 


Let the operator Tf be the delay operator, that is, 

The function $ defined in (|2^ verifies $(f) ^ £ 2 - 



Stt 

Y 


Proof: The fact that m(f) ^ PE is obvious because mi(t) 
Now, the function $ defined in ( |2T| takes the form 

sin(f) 


0 . 


Ci>(f) = 


sin(f — d) 
-ft+ 2 tt y/t + 2TT — d 


Whence 


$2(f) = 


/ sin(t) sin(f — d) 

V -ft + 27r + 2tt — d 

sin^(f) shf {t — d) 


t + 2 tt 
sin^(f) 
f + 27r 

— 2 cos(d) 
+ sin(d)- 


f + 27r — d 
sin^(f — d) 
t + 2 t: — d 
sin^(f) 


- 2 


sin(f) sin(f — d) 


y/t + 2 tt\/ t + 2TT — d 


ft + 2Tr\/t + 2Tr — d 
sin(2f) 


ft + 2TTft + 27r — d’ 

where some basic trigonometric identities have been used to derive the third identity. Note that the first three right hand terms 
of the last identity are not integrable. Since cos(d) < 0 in the admissible range of d the sum of these terms is also not 
integrable. On the other hand, the last right term verifies 

sin(2f) 


sin(d) 


ft + 27: ft + 2-K — d 


dt < 00 . 


Thus, $(f) ^ £ 2 - ■ 

Simulations of the overparametrized estimator ( |20l i with 0 = 1 are given in Fig. The simulations exhibit poor convergence, 
which is expected since m{t) f PE. After some trial-and-error gains tuning it is possible to improve the transient performance 
as shown in Fig. |^b). However, as pointed out above, it is not possible to reconstruct 0 since the function cos(0) is not 
injective for 0 G M. 

Simulations of the DREM estimator ( |24l l with 0 = 1 are shown in Fig. The estimation error 0(f) converges to zero and 
the tuning gain 7 allows to accelerate the convergence. 
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Fig, 6 : Transient behaviour of the error 6{t) transients of the DREM parameters estimator (|24li with different gains; 0(0) = —1. 


Remark 7. If the strong monotonicity condition ( [22| ) is relaxed to strict (or plain) monotonicity we can only ensure that 
V = —^^{6 — 0)['0i(0) — V’i(^*)] ^ 0 strict (PE-like) conditions must be imposed on $(f) to ensure convergence. 

Remark 8. Obviously, DREM is also applicable if ipi is monotonically decreasing, instead of increasing, or if it is that 
enjoys the monotonicity property. 


B. Second example 

Let us consider now the vector case with n = 2, p = 3 and q = 2. The regression ([T^ becomes 


yiW 


mu(f) 

mi2(f) 

nii3(f) 

y2(i) 


m2i(f) 

m22(f) 

m23(f) 


-1 

ipi{e) 


Md) 




(25) 


where : IR+ —)■ K, xnij : IR+ — )• K and ijjj : — )■ M, for i = 1, 2 and j = 1,2,3. To estimate two parameters we assume 
that two functions ipj satisfy a monotonicity condition—without loss of generality, we assume they are ^^2 and ^ 3 , and we 
define the “good” vector 

iltg := col(V^2, 

such that 


More precisely, we assume there exists a constant positive definite matrix P G 

PVlPgiO) + [VlPgi0)]^P > P 0 I 2 > 0 . 

As shown in 116|, see also | fT0| , the inequality above implies the existence of pi > 0 such that 

(a - b)^ P [ipg{a) - ipg{b)] > pi\a - 6 ^, Va,5 S K'^, 


(26) 


(27) 


which is the strong P-monotonicity property exploited in GD- 

The first task of DREM is to generate a two-dimensional regression that does not contain ipi. Eirst we apply a filter to 
either one of the rows of (|25]l, say to the first one, and pile-up the result as 


yi{t) 


mu(f) mi2(f) mi3(f) 


ipi{9) 

y2{t) 

= 

m2i(f) m22(f) m23(t) 


1p2{d) 

yifit) 


mu/(f) mi 2 /(f) mi3/(f) 




Next we multiply the last equation on the left by the matrix 

m2i(f) -mii(f) 0 

0 -mii(f) 

which is the left annihilator of the first column of the extended regressor. Defining the 2x2 regression matrix 


$ := 


m2i 

mu/ 


-mu 

0 


-1 

mi2 

mi3 


m22 

m23 


mi2/ 

mi3/ 
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and 


we get 


Yi := 


m2i -mil 0 

mil/ 0 -mil 


-1 

yi 


y2 


yi/_ 


Yi(t) = $(i)i/,g(0). (28) 

Multiplying ( |28l ) on the left by adj{$(f)} and defining Y(f) := adj{$(f)}Yi(t), we obtain the desired regression form 

Y{t)=det{m}^g{d)- (29) 

We propose the estimator 

0 = det{$}rP[Y-det{$}V'g(0)], (30) 

with r G a positive dehnite gain matrix. Using ( |29| ) the error equation is 

e = -det^{<i>}rP['lPg{9) - i^g{9)]. 


To analyse its stability define the Lyapunov function candidate 


1 


T T^ — 1 / 


whose derivative yields 


V{9) = -9'T 


U = -dee{d>}{9-9)^P[^g{9)-^g{9)] 

4.2 r^l 2pi 


(31) 


< —deU^jd)} 


Amax{r} 


If the matrix $(f) is full rank and det^{‘l>(f)} > k > 0, then 


U< -- 


2kpi 


V. 


-V, 


Ainax{r} 

and the analysis above ensures exponential stability of the error equation. 

Otherwise integrating the inequality yields 

V(t) < e" Wxtr} /o 

which ensures that 9{t) —?► 0 as f —?► oo if det{$(f)} ^ £ 2 - 

Remark 9. From the derivations above it is clear that there are many degrees of freedom for the definition of the regressor 
matrix $, e.g., the choice of the operator H, the rows to be filtered and mixed. The hnal objective is to impose some properties 
to the function det{$(f)}, ideally, that it is (uniformly) bounded away from zero—that yields exponential stability. 

Remark 10. In G) it is shown that the local verification of the monotonicity condition ( |2^ reduces to a linear matrix 
inequality (LMI) test provided some prior knowledge on the parameters is available. More precisely, assume 9 G & C K'^, 
with 

0 := {0 G I 0* G C ffi}- 


The quadratic approximation of the mapping ipg{9) verifies (26i if and only if the LMI 


>0, i = 1, . . . , (2«)« 


is feasible, where the vectors Vi G Q are computable from the vertices of 0. 
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C. A general result 

In this subsection a generalization of the previous examples is presented. We make the following assumption. 

Assumption 1. Consider the regression form ( [T^ . There are q functions ipi that, reordering the outputs j/^, we arrange in a 
vector ipg W^, verifying 

PVV’s(0) + > Polq > 0, 


for some positive definite matrix P S 

Consistent with Assumption we rewrite ( [T^ as 

yN{t) = mg(t) 

R" is the reordered output vector, nig 


mb(t) 


o-q 


y^b(&) 

nnxg 


nib 


pnx(p-q) 




(32) 


and 


where yjv : 
i/if, : —>■ K 

From the previous two examples we have learned that DREM must accomplish two tasks, on one hand, generate a regression 
without nif,. On the other hand, to be able to relax the PE condition, the new regressor matrix should be square (or tall). Given 
these tasks, to obtain a sensible problem formulation the following assumption is imposed. 


Assumption 2. The regression ( p^ satisfies 

q < p (33) 

n < p. (34) 

If ( |33| does not hold all functions ijji, i = 1,... ,p, satisfy the monotonicity condition and there is no need to eliminate 
any one of them. On the other hand, if is not satisfied a square regressor without the “bad” part of the regressor ipb can 
be created without the introduction of the operators Hi. Indeed, if n = p the matrix ruf, is tall and it admits a full-rank left 
annihilator : IR+ —> IR"?^". Moreover, the new regressor matrix m^tiig is square. A similar situation arises if n > p. 

Eollowing DREM we introduce n/ operators, apply them to some rows of ([3^ and pile all the regression forms to get 


Yn 


Mg Mb 





_M0)_ 


where we defined the matrices Mn 


"+"/)x9, Mb : IR+ ^ IE 

, Mb := 


Mg := 


m„ 


mg/ 


{n+nf)x{p-q) 

mb 

mb/ 


(35) 


(36) 


To select the number n/ of operators we notice that the matrix to be eliminated, that is Mb, is of dimension (n + n/) x {p — q). 
Therefore, to have a left annihilator for it with q rows, which is needed to make the new regressor square, we must fix 
n/ = p — n. Dehne 

(37) 


$ := M^Mg. 


Multiplying on the left by adj{$}M^ the equation ( |35| l yields the desired regressor form 

Y = det{$}V'g(0), 


where 


Y := adj{$}M^; 


Yn 

VNf 


(38) 


We are in position to present the main result of this section, whose proof follows from the derivations above. 


Proposition 4. Consider the nonlinearly parameterised factorisable regression ( [3^ satisfying Assumptions [T] and Introduce 
p — n linear, £oo-stable operators Hi : i G {1,2,.. . ,p — n} verifying Define the matrices Mg, Mb as given 

in ( [3^ . Consider the estimator ( |30| ) with $ and Y defined in ( |J7l i, ( |38] l and M^ : ]R_|_ —> a full-rank left annihilator of 

Mb. The following implication holds 

det{$(f)} ^ £2 lim l^(^)l = 0. 

t—>-oo 

Moreover, if det^{$(f)} > k > 0, then \0(t)\ tends to 0 exponentially fast. 
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D. Discussion on overparaineterisation 

As disused above the nonlinear regression ( [T^ can be “transformed” into a linear one defining the vector rj = 'ijj{9). Classical 
parameter adaptation algorithms can then be used to estimate this new parameter vector. However, overparametrization suffers 
from the following well-known shortcomings 0^ 0: 

(i) Performance degradation, e.g., slower convergence, due to the need of a search in a bigger parameter space—indeed, 
usually q<p. 

(ii) The more stringent conditions imposed on the reference signals to ensure the PE needed for convergence of the parameters 
V- 

(iii) Inability to recover the true parameter 9 —except for injecting mappings. This stymies the application of this approach in 
situations where the actual parameters 9 are needed. 

(iv) Conservativeness introduced when incorporating prior knowledge in restricted parameter estimation. 

(v) Reduction of the domain of validity of the estimates stemming from the, in general only local, invertibility of the mappings 

r/i. 


E. Generating separable regressions for nonlinear dynamical systems 

Consider the general case of a nonlinear system whose dynamics depend nonlinearly on some unknown parameters, say, 

x = Fo{x) + Fi{x,9), (39) 

where Fq : K" —>■ M", Fi : K” x —>• K" are known functions, x : K+ —>■ K" is the measurable systems state and 9 
are the constant unknown parameters. It is assumed that the system evolves around some operating point cc* G K". 

Applying the classical filtering technique GZ) it is possible to generate a state-dependent regression that can be used for 
parameter estimation. Indeed, defining 

y := 

S(x,9) := 

and neglecting exponentially decaying terms, we get the nonlinearly parameterized regression 

y = E(x, 9) (40) 

where y : M+ —)• K” and ^ ; K" X K'J —K". Estimation of the parameters at this level of generality is a daunting task. To 
simplify it we propose to linearize the function 5 around x* to obtain a separable nonlinear regression as done in the lemma 
below. 

Lemma 1. The first order approximation (around x*) of the nonlinearly parameterized regression ( |40l l is given by ( [T9| l, where 
m : K+ —)■ and r/i : > K?', with p := n + n^, are given by 

0 ... 0 1 ' 

x^(f) ... 0 

0 ... x^it) 

where we have introduced the notation x := x — x*. 

Proof: The proof follows trivially rearranging the elements of the first order approximation (around x*) of 5, namely 

5(x, 9) = 5(x*, 6>) + VjS(x*, 0)(x — x*) + C9(|x — x*p), 


m(f) := 


x^{t) 


-^{ 9 ) := 


V,5i*{0) 
^ {9) 


^ X -^Fo(x) 


p + 1 p + 1 


and using the definition of x. 
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Next, from the derivations of the subsection III-C if at least q elements of the vector V'(^) enjoy monotonicity, the DREM 
approach can be applied. 


Remark 11. From the structure of the regression matrix given in Lemma [T] the following equivalence holds true 


m(f) S PE 


1 

x{t) 


e PE. 


Clearly, the PE condition is not satisfied in regulation tasks, when x{t) tends to zero. 


Remark 12. We have generated the regressor form ( |40| l applying the classical filtering technique. As shown in pO) , fTT] it is 
possible to design parameter estimators directly for the system ( [39] l using immersion and invariance techniques fTS) . Also, we 
have considered the case of a closed system, that is, without input. The same derivations apply for systems with an external 
input signal. 


IV. Concluding Remarks and Future Research 

A procedure to generate new regression forms for which we can design parameter estimators with enhanced performance 
has been proposed. The procedure has been applied to linear regressions yielding new estimators whose parameter convergence 
can be established without invoking the usual, hardly verihable, PE condition. Instead, it is required that the new regressor 
vector is not square integrable, which is different than PE of the original regressor. For nonlinearly parameterised regressions 
with monotonic nonlinearities the procedure allows to treat cases when only some of the nonlinearities verify this monotonicity 
condition. Similarly to the case of linear regressions, convergence is ensured if the determinant of the new regressor is not 
square integrable. 

The design procedure includes many degrees of freedom to verify the aforementioned convergence condition. Current 
research is under way to make more systematic the choice of this degrees of freedom. It seems difficult to achieve this end 
at the level of generality presented in the paper. Therefore, we are currently considering more “structured” situations, for 
instance, when the original regression form comes from classes of physical dynamical systems or for a practical application. 
Preliminary calculations for the problem of current-voltage characteristic of photovoltaic cells—which depend nonlinearly on 
some unknown parameters—are encouraging and we hope to be able to report the results soon. 
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